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Abstract 

The notions of universality and completeness are central in the theories of 
computation and computational complexity. However, proving lower bounds 
and necessary conditions remains hard in most of the cases. In this article, 
we introduce necessary conditions for a cellular automaton to be "universal" , 
according to a precise notion of simulation, related both to the dynamics of 
cellular automata and to their computational power. This notion of simulation 
relies on simple operations of space-time rescaling and it is intrinsic to the model 
of cellular automata. Intrinsic universality, the derived notion, is stronger than 
Turing universality, but more uniform, and easier to define and study. 

Our approach builds upon the notion of communication complexity, which 
was primarily designed to study parallel programs, and thus is, as we show in 
this article, particulary well suited to the study of cellular automata: it allowed 
us to show, by studying natural problems on the dynamics of cellular automata, 
that several classes of cellular automata, as well as many natural (elementary) 
examples, were not intrinsically universal. 

Keywords: cellular automata, communication complexity, intrinsic 
universality. 



1. Introduction 

Since the pioneering work of J. von Neumman llSlI . universality in cellular 
automata (CA) has received a lot of attention (see [12[ for a survey). Histor- 
ically, the notion of universality used for CA was more or less an adaptation 
of the classical Turing-universality. Later, a stronger notion called intrinsic 
universality was proposed: A CA is intrinsically universal if it is able to simu- 
late any other CA [sl, 0, [13] through a uniform and regular encoding based on 
rescaling. 

This definition of intrinsic universality may seem very restrictive. However, 
it can be very common among natural families of C A [l| , and allows a complete 
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and precise formalization of the notion of universality H. As we are going to see, 
this preciseness, and the robustness of this definition, allows for concrete proofs 
of negative results and lower bounds. 

Indeed, in this paper we will explain how to rule out particular elementary 
cellular automata, as well as whole well-known classes of cellular automata, 
from being intrinsically universal, using the elegant framework of communication 
complexity. 

In Section [5] we give the basic definitions. One of the key definitions is the 
following: Given a traditional computational problem V with an arbitrary input 
w, we can split the input into two subwords wi and W2\ therefore, we can refer 
to the "comunication complexity" of such problem {wi is given to Alice while 
W2 is given to Bob). 

In Section [3] we introduce a family of "canonical problems" concerning vari- 
ous aspects of the dynamics of a given CA. In other words, for any CA F and 
any prototype problem V , we consider the problem Vf- 

In Section |4] we explain how to infer properties of F from the study of 
the communication complexity of Vf- More precisely, we prove that if the 
communication complexity of one of our canonical problem Vf is not maximal, 
then F is not intrinsically universal. In other words, we are introducing a 
powerful tool for ruling out CA from being intrinsically universal. We conclude 
that linear, expansive and reversible CA are not intrinsically universal. We 
also show the uncomparability of our three canonical problems: none of them 
is sufficient to discard all non-universal cellular automata, and none of them is 
stronger than any other. 

In Section[S]we explain clearly why the communication complexity approach 
appears to be a promising tool for ruling out CA from being intrinsically univer- 
sal. More precisely, we prove computational intractability results about prob- 
lems that our framework considers very simple. 

Finally, in Section IH] we use our results to prove that a few concrete ele- 
mentary CA are not intrinsically universal. Although looking at several space- 
time diagrams of these automata might give a strong intuition about their non- 
universality, we stress that producing complete formal proofs for such a negative 
result is a difficult task and, as far as we know, had never been done before. 

2. Basic definitions 

2.1. Communication complexity 

Communication complexity is a notion introduced by A. C.-C. Yao in [l6j , 
and designed at first for lower-bounding the amount of communication needed in 
distributed algorithms. In that model he considered two players, namely Alice 
and Bob, both with arbitrary computational power and communicating to each 
other in order to collaboratively decide the value of a given function. More 
precisely, for a function (p : X xY ^ the question is "how much information 
do they need to exchange, in the worst case, in order to compute 4){x, y), with 
Alice knowing only x and Bob only y" . 



^ There is actually no consensus on the formal definition of Turing-universality in CA (see Q 
for a discussion about encoding/decoding problems). 
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This communication problem is solved by a protocol, which specifies, at 
each step of the communication between Alice and Bob, who speaks (Alice or 
Bob), and what she/he says (a bit, or 1), as a function of her/his respective 
input. This simple framework, and some of its variants we discuss in this article, 
appears to be promising for studying CA. 

A protocol V over a domain X x Y with range Z is a binary tree where 
each internal node v is labeled either by a map : X ^ {0, 1} or by a map 
by : Y —i' {0,1}, and each leaf v is labeled either by a map : X ^ Z ot hy a 
map By : Y Z . 

The value of protocol V on input {x,y) E X xY is given by Ay{x) (or By{y)) 
where Ay (or By ) is the label of the leaf reached by walking on the tree from 
the root, turning left if ay{x) = (or by{y) = 0), and right otherwise. We say 
that a protocol computes a function : A x y ^ Z if for any {x, y) £ X x Y, 
its value on input {x, y) is 4)(x, y). 

Intuitively, each internal node specifics a bit to be communicated either by 
Alice or by Bob, whereas at the leaves either Alice or Bob determines the value 
of (j) when she/he has received enough information from the other party. 

In our formalism, we do not ask both Alice and Bob to be able to give the 
final value. We do so in order to consider protocols where communication is 
unidirectional. 

We denote by cc(0) the (deterministic) communication complexity of a func- 
tion 4) : X xY ^ Z . It is the minimal cost of a protocol, over all protocols 
computing </), where the cost of a protocol is the depth of its corresponding tree. 

One approach for proving lower bounds on the communication complexity 
of an arbitrary function (j) is based on the so-called fooling sets (for a deeper 
presentation of this theory we refer to jl| ) . 

Definition 1. Given a function (f) : X x Y ^ Z, a set S C X x Y is a fooling 
set for (j) if there exists z £ Z with: 

1. y{x,y) e S, 4>{x,y) = z, 

2. y{xi,yi) G S,y{x2,y2) e S, either (/)(xi,y2) ^ z or (I){x2,yi) ^ z. 
The usefulness of fooling sets is given by the following lemma (see 3). 

Lemma 1. // S is a fooling set of size m for (j) then cc{(j)) > log2(m). 

In addition to ad hoc fooling set constructions, we will use the following 
classical lower bounds on communication complexity (the proofs appear in [8] ) . 

Proposition 1. Let n > 1 be fixed. Let (p^q, 0ip o,nd (/)dis,i be the functions 
"equality", "inner product" and "disjointness" defined from {0,1}" x {0,1}" to 



{0, 1} by: 




otherwise. 





The following lower bounds hold: 
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• cc((/>eq) > n. 

• cc((?iip) > n. 

• cc{(j)ois}) > n. 

2.2. Splitting the input of computational problems 

Let us consider now classical computational input-output problems. In this 
work we will only encounter problems of the form V : Q* ^ Z, whose inputs 
are words over some alphabet Q and outputs are elements of a finite set Z. 
Moreover, wc will always have Z = Q oi Z = {0^1} as output sets. 

Given such type of problem V, we define, for any n, its restriction to words 
of length n; i.e, we consider the restricted problem V\n ■ Q" Z. 

The key idea of the communication approach is to split the input into two 
parts: For any 1 < i < (n — 1), we define V\n : x — >■ Z. More precisely, 
for every x G Q^,y S we have ■p|^(x,y) = Vlnixy). Then, we can 

consider the communication complexity cc(P|^) of the ith split function T-"]^. 
Of course the choice of i matters and can alter the corresponding communication 
complexity. Since we don't want to rely on an arbitrary choice, we consider the 
worst case. This yields the following definition: 

Definition 2. Let V : Q* ^ Z he a problem. The communication complexity 
ofV, denoted GC{V), is the function: 

max ccfPI!,). 

l<i<n-l ^ '"-^ 

2.3. Cellular automata 

In this paper we are always going to consider one-dimensional CA. A CA 

is defined by its local rule / : Q'^^'^^ Q (where Q corresponds to the set of 
states and r denotes the radius of the local rule). For any n> 2r + 1, we extend 
/ : _j. g to the more general f : Q"" ^ Q"-^'' by 

f{ui ■ ■ ■ Un) = f{ui ■ ■ ■ U2r+l) ■ ■ ■ /(Wn-2r ' ' ' Un)- 

Moreover, for every 1 < t < [{n — l)/2rj, we define the f-steps local iteration 
as /* : Q" ^ Q"-2 '^ * by 

f = f 

f\ui ■■■Un) = /(.f*"H"l • • • Un-2r) ' ' ' /*"^(W2r+l ' ' ' 

We also define f*:Q*^ Q* by 

= /L^J(n). 

Intuitively, /* applied on u consist in iterating / as long as possible (until 
ending up with a word too short for /). The result is a word of length at most 
2r (depending on |u| mod 2r). 

We denote hy F : ^ the global rule induced by / following the 
classical definition: 

= f{Cz-r,■■■,Cz+r)■ 
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Finally, we denote by F* : — > the i-step iteration of the global function 

F. 

A global function F can be represented by different local functions. All 
properties considered in this paper depend only on F and are not sensitive to 
the choice of a particular local function. However, to avoid useless formalism, 
we will use the following notion of canonical local representation: (/, r) is the 
canonical local representation of F if / has radius r and it is the local function 
of smallest radius having F as its associated global function. 

Throughout this work we are going to refer to the CA F with (/, r) being 
its canonical local representation. 



3. The three canonical communication problems 

In this section we define the three "problem schemes" on which we are go- 
ing to apply the communication complexity approach. Before entering into 
details, we stress that this set of problems tackles various dynamical aspects of 
CA: Transient, periodic and asymptotic regime starting respectively from finite, 
cyclic, or ultimately periodic configurations. Moreover, algorithmically speak- 
ing, they are also very different since they belong respectively to the classes P, 
PSPACE, and 11]' (and can be complete for these classes as we will see in this 
section). 

Thus, they form an interesting set of prototype problems. 
3.1. Prediction 

The prediction problem consists in determining the far future of a cell given 
the state of sufficiently many cells around it. 

Definition 3. Let F be a CA. The problem. Predj? : Q* ^ Q is defined as 
follows: 

Predf(u) = (/*(w))p 

where {f,r) is the canonical local representation of F while the "{f*{u)) no- 
tation means that we take the first letter of the word f* (u) , which has length at 
most 2r. 

Clearly, this problem is in DTlME(ri^), and, as we have already said before, 
we can also view PREDi? as a communication problem (see Figure [Ij: Given 
an initial configuration as input, we split the initial configuration between Alice 
and Bob, and ask for the final value computed by F on this input configuration. 



as represented in Figure 1(b) 



More precisely, for every 1 < i < (n — 1), PredfI^ : x Q"^' Q is such 
that PREDi?|5j(x, y) = {f*{xy))i. This function PREDi?|5j can be represented 
as a IQI' X matrix. In other words, we give i states to Alice (rows) and 

n — i states to Bob (columns); i.e. X ~ and Y = Q"~^. We denote by 
Mp^ such a matrix. In the examples of Figure [21 we have n = 2« + 1 = 13 and 
n = 2i -I- 1 = 15 (for the elementary CA Rule 178). 

Remark. We can consider the more restricted one-round communication com- 
plexity. In this setting only one party (either Alice or Bob) is allowed to send 
information. This restriction is justified by the fact that, according to a theorem 
of 0/; by simply counting the number of different rows or columns of a certain 
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I ] Result 
Alice Bob 



(a) A space-time dia- 
gram of Rule 110. 



(b) The commmunica- 
tion interpretation of 
PREDjTik,. 



Figure 1: Problem PaEDi?jj, 




matrix we obtain the exact one-round communication complexity of the func- 
tion. In our framework, the one round communication complexity o/PREDj^j^ 
corresponds to the minimum between the number of different rows and different 
columns of Mp'^ . Therefore, performing computational experiments in order to 
infer the one-round communication complexity of PREDp\l^, becomes an easy 
task. 

Recall that, given a CA F, the communication complexity of PREDi? is 
defined as: 



CC (Predf) = 1-^ max cc(PREDi?|^) ■ 

l<i<n— 1 

Remark. In the above definition of Predp , we choose a canonical local repre- 
sentation {f,r) for the CA F. Replacing f by another valid local representation 
can change the problem and its communication complexity. However this change 
would only introduce a multiplicative factor and therefore would not alter the 
main point of this paper (Section \4.3^ . 

Now we show that some well-known properties of CA induce small upper 
bounds for the communication complexity of the prediction problem. The re- 
sults below are adaptations of ideas of [4] to the formalism adopted in the present 
paper. 

Proposition 2. Let F be any CA and (/, r) be its canonical local representation. 
If there is a function 5 : N — )• N such that /" depends on only g{n) cells, then 
CC(PreDf) < gin)/2. 
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In the work of M. Sablik [13|, CA which are equicontinuous in some direction 
are considered. Following Theorem 4.3 of [3], they have a bounded number 
of dependant cells (i.e, a bounded function g(n)). A well known example of 
such CA are the nilpotent CA (a CA is nilpotent if it converges to a unique 
configuration from any initial configuration, or equivalently, if is a constant 
function for any large enough t). 

Corollary 1. If F is an equicontinuous CA in some direction then 

CC(Predf) e 0(1). 

Another set of CA with that property is the set of linear CA. A CA F with 
state set S is linear if there is an operator ® such that {S, ®) is a semi-group 
with neutral element e and for all configurations c and c' we have: 

F{cWc') = F{c)WF{c'), 

where ® denotes the uniform (cell-by-ccU) extension of 0. 

Proposition 3. If F is a linear CA then CC (PREDi?) e 0(1) 

Proof. The proof appears in Q in a different setting. The idea is that there is a 
simple one-round protocol to compute linear functions: Alice and Bob can each 
compute on their own the image the function would produce assuming the other 
party has only the neutral element as input; then Alice or Bob communicate 
this result to the other who can answer the final result by linearity. □ 

3.2. Invasion 

Let _F be a CA and let u be a given word. Roughly, the problem Inv^ is 
defined as follows: Given an input word w, we define the M-periodic configuration 
Pu on the one hand, and the configuration p„(w) obtained by putting the word 
w at the origin over p^ on the other hand; the invasion problem consists in 
determining whether the differences between pu and Pu{w) will expand to an 
infinite width as time tends to infinity (hatched surface on Figure ^ . 

As we show in Proposition [521 the general case is, from the point of view of 
classical algorithmic theory, undecidable. 



u 


u 


Ahce ' Bob 


u 


u 



Figure 3: The INVASION problem 

Now we give formal definitions. 

Definition 4. Let u — ui . . . ui be a finite word. Let pu he such that for all 
i Pu[i] — u[i mod I]. 
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• we consider the ultimately periodic orbit {F^{pu))^ os the reference orbit; 

• for each xi, . . . ,Xn G Q, we define the configuration . . . , a;„) ob- 
tained by modifying p„ as follows: 



{Pu)z for z < or z > n + 1, 
Xz otherwise. 



• for each t, we denote 6i(t) and Sr{t) the lefmost and rightmost differences 
between the t*^ images of Pu and Pu{xi, . . . , 

Si{t) = mm{z : F\pu)z ^ . . ,a;„)) J, 

Srit) = maxjz : F\pu)z ^ F^{pu{xi, . . . ,x„))^}. 

• then Inv^(xi . . .a;„) equals 1 if Sr{t) — 6i{t) — >t oo and otherwise. 

As explained before, we associate to any F and m, the communication com- 
plexity of Inv^ defined as CC (Inv^). 

Some CA have by nature a trivial invasion complexity because their dynam- 
ics consists in propagating errors systematically. This is the case of (positively) 
expansive CA. Recall that F is (positively) expansive if there is some e > such 
that: 

yx,y, x^y^ 3t, d{F\x),F\y)) > e 
where d is the Cantor distance. 

Proposition 4. Let F be a positively expansive CA. Then for all u we have 
CC(lNV^) = 1. 

Proof. Fix any u and consider any (xi, . . . , Xn) such that p„(a;i, . . . , a;„) 7^ p„. 
By classical results of P. Kurka 01, there is a positive constant a (average 
propagation speed) such that Si{t) < —at and Sr{t) > at. Therefore, invasion 
occurs if and only if: 

Pu{xi, ...,Xn) y^pu. 

Testing this condition can be done with only 1 bit of communication: Either 
Alice or Bob communicates whether she (or he) sees any difference between 
her (or his) input and the corresponding part of p„; then the other party can 
answer. The proposition follows. □ 

3.3. Cycle length 

For this last problem, we consider spatially periodic configurations. Since 
there are only a finite number of such configurations of a given period size, 
and the size of the period does not grow with time, then clearly the evolu- 
tion becomes periodic (in time) after a certain number of steps (see Figure [3] 
where successive steps are represented by successive concentric circles). Roughly 
speaking, the cycle problem consists in determining whether the length of this 
ultimate (temporal) period is small, starting from a given (spatially) periodic 
initial configuration. The formal definition follows. 
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Figure 4: The Cycle problem on elementary CA Rule 33. Since the configurations are cyclic, 
we can represent one configuration on a circle. Time goes from the inner circle to outer circles, 
zeros are white, and ones are black. For instance, the initial configuration - on the innermost 
circle - is 011011. After one step, it becomes 100100. 



Definition 5. Let F be a CA and let k > 1. For any u G Q* we denote by A(w) 
the length of the ultimate period of the orbit of configuration Pu under F: 

\{u) = mm{p : 3to,Vt > to,i^*(p«) = F'+P{pu)]. 

The problem Cycle^ is then defined by: 

1 otherwise. 

One of the interests of the cycle length problem lies in the following com- 
plexity upper bound for reversible CA. 

Proposition 5. Let F be any reversible CA. Then, for any fixed k, we have: 

CC fCYCLE|.) G 0(1). 



Proof. For a reversible CA, orbits of periodic configurations are not only ulti- 
mately periodic but also periodic. More precisely, for any periodic configuration 
c, the cycle length starting from c is less than k if and only if: 

3t<k: F\c) = c. 

Thus, Alice and Bob can simply simulate the automaton for A; steps, then 
check if a configuration repeats during these steps : this can be done with 
Ak ■ r ■ [1 + log Q\ bits, to transmit the cells next to the border between Alice 
and Bob's respective parts, then one bit for Alice to tell Bob if a configuration 
appeared twice during the k steps. 

□ 



4. The three corresponding necessary conditions for intrinsic univer- 
sality 

In this section we show that intrinsic universality implies that the commu- 
nication complexity of the three canonical problems described above must be 
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maximal. Before giving precise definitions, recall that a CA is intrinsically uni- 
versal if it is able to simulate any other CA. Our approach with communication 
complexity proceeds in two steps: 

• we show that the simulation of by G implies a reduction from any 
canonical problem for F to the corresponding problem for G in such a way 
that the communication complexity is preserved (up to some distortions 
involving only multiplicative factors); 

• we show the existence of maximal communication complexity CA for each 
of the canonical problems. 

Before developing these two steps, we give formal definitions for simulations 
and intrinsic universality. 

4-.1- Simulations and universality 

The base ingredient is the relation of sub-automaton. A CA F is a sub- 
automaton of a CA G, denoted hy F Q G, if there is an injective map i from 
Qi? to Qg such that To F ~ G o I, where I : Qf ~^ Qg denotes the uniform 
extension of l. 

A CA F simulates a CA G if some resettling of G is a sub-automaton of 
some resettling of F. The ingredients of the rescalings are simple: packing cells 
into blocks, iterating the rule and composing with a translation. Formally, 
given any state set Q and any to > 1, we define the bijective packing map 
h^:Q^^ {Q"'f by: 

Vz e Z : (bm{c)) (z) — (^c{mz), . . . , c{mz + rn — 1)) 

for all c G Q^. The rcscaling p<™'t-^^> of F by parameters m (packing), t > 1 
(iterating) and z e Z (shifting) is the CA of state set and global rule: 

bm ° cr^ o F* o b~^ . 

The fact that the above function is the global rule of a cellular automaton 
follows from Curtis-Lyndon-Hedlund theorem |6| because it is continuous and 
commutes with translations. With these definitions, we say that G simulates 
F, denoted ^ G, if there are rescaling parameters toi, TO2, ti, ^2, zi and Z2 
such that F<'"i'*i'^i> C G<'"=^'*^'^^>. 

We can now naturally define the notion of universality associated to this 
simulation relation. 

Definition 6. F is intrinsically universal if for all G it holds that G =4 F- F 
is reversible universal if for all reversible G it holds that G ^ F. 

We consider the following relation of comparison between functions from N 
to N: 

(j^i ^ h 3a, /3,7,5 > l,Vn e N : (?!)i(an) < /3(?!)2(7n) + (5. 

Remark. All the functions we will compare by -< are in 0(n) since they come 
from a communication complexity problem. Moreover, the set of such functions 
that are in fl(n) form an equivttlence clttss for -<. Although we sometimes give 
more precise bounds, most of the paper focuses on whether or not some function 
belongs to this class. 
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Proposition 6. If F 4 G then CC (PREDir) -< CC (Predg). 

Proof. We successively consider each "ingredient" involved in the simulation 
relation. 

Sub-automaton: if C G then each vaHd protocol to compute PredgIJj 
is also a valid protocol to compute iterations of Pred/t-I^ (up to state 
renaming) . 

Iterating: We have CC (PREDir*) e 6(CC (Predf)). In fact, if we have a 

protocol for the prediction problem of _F* - which is an automaton of 
radius t-r- then we can use it to predict : on a configuration x of size 
n, we use the protocol to predict the result of iterating [^J times F*, 
which gives a configuration of size at most t-r~l. To do this, we just use 
the protocol at most t-r—1 times to predict each cell of this configuration, 
then Alice or Bob conclude by simulating the automaton directly. 

The other direction is even simpler: a protocol for Predf can be used 
directly for PreDj?* by just slightly reducing the input of Bob. 

Shifting: This operation only affects the splitting of inputs. Since we always 
take in each case the splitting of maximum complexity, this has no influ- 
ence on the final complexity function. 

Packing: let be any CA and n be fixed. Consider the problem PREDir<m,i,o>|^ 
for some j. Now consider any sequence of valid protocols (Pi), one for each 
problem PREDir jj^^. It follows from the the definition of packing maps 
that PREDir<m,i,o> 1^ can be solved by applying m suitably chosen proto- 
cols in the sequence {Pi). Therefore 

CC (PREDir<m,i,o>)(n) < m • CC (PREDir)(n) 

Reciprocally, one has for all n: 

CC(PREDir)(n) < CC (PREDj<™,i,o>)([n/m]) +TO 

where the additional constant m is used to deal with input splittings of 
PREDir|„ which have no equivalent in PREDj<m,i,o> 1 1-„/^] because they 
do not cut the input at a position which is multiple of m. 

Therefore we have: CC (PREDi?) -< PREDp<^,t,z> , PREDi7'<™,t,^> -< CC(PREDir) 
and if F C G then CC (PREDi?) -< Predq. The proposition follows. □ 

The following result shows that the invasion complexity is increasing with 
respect to simulations. 

Proposition 7. If F ^ G then for all u there is v such that 

CC (INV^) -< CC (INV^). 

Proof. The simulation relation =<; is such that iiltimatcly periodic configurations 
of F are converted into ultimately periodic configurations of G. Hence, the 
invasion problem of F reduces to the invasion problem of G. More precisely, it 
is sufficient to check the following properties, each dealing with an aspect of the 
simulation relation =<;; 
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for any CA F, any u and any rescaling parameters m, t, z, we have 



CC (INV^) -< CC (INV' 



p<m,t,z> 



) 



where U is the period of the configuration bm{Pu)] 

• ifF^G then, for any u, CC (Inv^) -< CC (Inv^); 

• for any CA F, any rescahng parameters m, t, z, any U (over the alphabet 
of F<™'*'^>) CC (lNV^<„,t,.>) ^ CC (INV^) where u is the period of the 
configuration b^{pu)- 

The result follows by composition of the 3 properties above. □ 

Finally, we show a similar result for the cycle length problem. The problem 

is parametrized by an integer k and the following proposition establishes that 
for suitable but arbitrary large values of this parameter the complexity of the 
problem is conserved. 

Proposition 8. If F ^ G then for all ko there is k and k' such that: 

• k > ko and k' > ko; 



Proof. The effect of rescaling transformations on cyclic orbits of periodic config- 
urations is to change the (spatial) period length as well as the (temporal) cycle 
length. More precisely, we have: 




ifFQG then, for any k, CC (Cycle^ ) ^ CC (Cycle^ 



for any k, 



- CC 



- CC 




and 



for any t 



and any k we have: 




for any t 



and any k such that k mod t = we have: 




The proposition follows. 



□ 
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4-2. Existence of CA with maximal complexity 

This section is devoted to the foUowing existence result. 

Proposition 9. 

1. There exists a reversible CA F and a word u with CC (Inv^) € 

2. There exists a reversible CA F with CC (PREDi?) e Vt{n). 

3. There exists a CA F s.t. for any fc > 1, CC (^Cycle|^ e Vt{n). 

We now define the reversible C A of assertion 2 of Proposition [HI which we 
call G in the sequel. It is made of 3 layers: 

• flag layer Qf = {0,1}, 

• circulation layer Qc — {W} U {0, 1} x {0, 1}, 

• test layer Qt ^ {0, 1} x {0, 1}. 

The flag layer is simply the identity over Q f . The circulation layer does not 
depend on other layers and has the following behaviour. 



circulation 
flag 
test 



Xi 











1 



circulation 
flag 
test 



-^TTr/*0 Xl 




Xn ^ 


^^^^0 Vn 


Vk 


2/1 0^^^ 





1 






110 1 



10 11 



Figure 5: Above: initial configuration. Below: the configuration k steps later. 



• normal states in {0, 1} x {0, 1} represent two sub-layers (top and bottom) 
and, if no W state is in the neighbourhood, the top sub-layer simply shifts 
to the right and the bottom sub-layer simply shifts to the left. 

• W states are walls: They stay unchanged forever. Moreover, a normal cell 
on the right of a wall has the following behaviour: The top value shifts to 
the right and the bottom value goes to the top. A normal cell on the left 
of a wall has a symmetric behaviour: The bottom value shifts to the left 
and the top value goes to the bottom. See figure [5] 

Finally, the test layer is made of two sub layers (top and bottom) wich are 
independant. The top layer does the following: 

• if the flag layer of the cell is 1 and if the circulation layer contains the 
state (1,1) then invert bit and shift right; 

• in any other case, simply shift right. 
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The bottom sub- layer does the same but replace right by left. 
Proof of Proposition 

1. We first show that G defined above has the properties of assertion 1 of 
the proposition. First, it is reversible: the flag and circulation layers are 
themselves reversible, and the knowledge of these two layers makes the 
flag layer reversible too. 

Now let qo be the state where flag layer is 0, circulation layer is (0, 0) and 
the test layer is (0, 0). Consider input bits xi, . . . ,Xn on the one hand and 
2/1, . . . ,?/„ on the other hand. Let Xi be the state with flag layer 0, test 
layer (0, 0) and circulation layer {xi,0). Similarily let Yi be the state with 
flag layer 0, test layer (0,0) and circulation layer (0, yi). Let M be the 
state of flag layer 0, circulation layer W and test layer (0, 0). Finally let 
T be the state of flag layer 1, circulation layer (0, 0) and test layer (0, 0). 
Consider the configuration C{xi, . . . , j/i, . . . , j/„): 

"(70 Af X„ • • • Xi T Yi • • • r„ M q'^ 

We can consider this configuration as an instance of the invasion problem 
Inv^2,i+3 where u = q^. The only possible invasion in such an instance 
comes from the test layer. It follows from the definition of G that there is 
invasion on this instance if and only if 

3^i,Xi = j/i = 1. 

Hence, the disjointness problem reduces to the invasion problem through 
such instances. Using proposition [H we conclude that CC (InVq ) G Vl{n). 

2. Assertion 2 of the proposition can be proven with a CA F simpler than 
G, but using similar ideas. F has radius 1 and its state set is the product 
of 3 components: 

• left circulation with state set {0, 1}, 

• right circulation with state set {0, 1}, 

• test with state set {0, 1}. 
The behaviour is the following: 

• each of the left and right circulation components are independent 
of the other components and consists in simple shift (left and right 
respectively), 

• the test component simply fiips its value if both left and right circu- 
lation components have value 1 and stays unchanged else. 

F is clearly reversible (circulation layers are independent shifts and test 
layer is reversible knowing other components). Moreover, the inner prod- 
uct problem reduces to the prediction problem of F. Indeed, for any 
x,y e {0,1}" consider the word 

where Xi is the state equal to Xi on the right circulation component and 
elsewhere, Yi is the state equal to yi on the left circulation component and 
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else, and Z is the state equal to everywhere. It follows from definition 
of F that 

PREDi?|„(ii;) = 1 ■^=^> ^a^ij/i mod 2 = 1. 

proposition [1] implies that CC (Predf) e f2(n). 
3. We use the problem DISJ to build a hard Cycle problem. The idea is 
that if Alice and Bob receive two disjoint sets as their inputs, our CA will 
check DISJ forever. Otherwise it will erase all the tape, leaving a uniform, 
1-periodic, configuration. 

We use three layers in this construction, let us call the corresponding rules 
Fi, Fi and F3. They are all of radius one, and all use the same set of states 
{0, 1,-?^}. The K state is used to erase all three tapes: thus, if it appears 
on any component, it spreads on all three. 

On (local) configurations not involving i^T, F\ is a simple left shift, and 
F2 a simple right shift. We use F-j, as a control layer: we need to check if 
the two other components represent two disjoint sets. The correponding 
bitwise operation is: 

n 



Xl 



_0__0_ 

Vn 



yi 



(Ati -(^» A yO) = ? 



Figure 6: An automaton with a hard Cycle problem, and an easy Inv. 

This corresponds to the following (partial) rule: 




We consider a cyclic configuration containing an input for Alice on the 
first layer, and an input for Bob on the second layer, (as in Figure H]), and 
a third layer everywhere empty, except for a central "test" state, actually 
performing the tests. While the test value is 1, the tests go on. There are 
three cases: 
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• If both Alice and Bob receive the empty set, the configuration is 1- 
periodic, but AUce and Bob can detect this case with a single bit of 
communication. 

• Else, since the tape is cyclic, if Ar=i ~^{xi^Vi) ^ 1; then the test goes 
on forever, producing a (temporal) cycle of length ri(n), because in 
this case, at least one Xi or one yi is 1, and it is separated from the 
next 1 (possibly itself !) by at least the 2n + 1 zeros depicted on 
figure m 

• Otherwise, the test becomes at some step and a spreading state is 
generated, which erases all the layers in both directions and produce 
a (temporal) cycle of length 1. 

Thus, except in the case where both sets are empty, this is an "implemen- 
tation" of the DISJ problem, shown in f2(n) for several variants of commu- 
nication complexity in 8]. This proves that this automaton can embed 
an ri(n) communication problem in some of its configurations, which is 
enough to prove that its Cycle problem is hard. 

□ 

Remark. We prove in section l4-4-4\ ihat the last construction of proposition\^ 
has an Inv problem in 0(1). 

4-. 3. Necessary conditions for universality 

The following corollary is the main tool provided by this paper to prove 
negative results about (intrinsic) universality. 

Corollary 2. Let F be an intrinsically universal CA. Then it holds that: 

1. there exists u s.t. CC (Inv^) e VL{n), 

2. CC(Predf) G VL{n), 

3. there exists k s.t. CC (^CvCLEp^ £ fl{n). 

Moreover, if F is only reversible-universal, then\^ and{^ still holds. 

Proof. It follows from Propositions|ni[Zla'nd[H]on the one hand, and Proposition[5] 
on the other hand. □ 

A first application of this corollary to the complexity upper-bounds presented 
in Section [3] yields the following necessary conditions for universality. The first 



proofs of these results appears in IJ]. However, our approach allows us to 



formulate much simpler and more elegant proofs. 

Corollary 3. Let F be an intrinsically universal CA, then F cannot be; 

• neither expansive 

• nor linear 

• nor reversible. 

Moreover, a reversible universal CA can not be expansive or linear. 
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4-. 4- Uncomparability of the three conditions 

Here we show the "orthogonahty" of our three problems: For any pair of 
problems (Vq^Vi), we exhibit two CA, A and B, such that: 

• CC (r^) G o(CC ("Pi^)), in which case we say that A is "hard" for Vi 
and "easy" for Vq. 

• CC (-Pf ) e o(CC (-P^)), in which case we say that B is "hard" for Vq and 
"easy" for Vi- 

This shows that our three necessary conditions for intrinsic universality are 
really necessary: no condition is stronger than any other. 

44- 1- ^ CA easy for Pred and hard for Inv 

The idea is to embed an equality test (more precisely, a palindrom test) 
launching signals invading the whole configuration, while keeping the prediction 
problem easy; see [1] or proposition [T] to see why this problem requires n{n) 
communicated bits. The idea is to use two components that both stay easy for 
Pred: one with tests that do not alter the component, and one with signals, 
moving quickly out of the way: 

1. The first layer performs tests for equality, as described below, and initialy 
contains a word over the alphabet Fi = { , 1 , , 1 ,T,0i,_ft'i}. On 
figure [71 this layer is drawn with full lines. 

The dynamic of the first layer is simple : 7^ states shift right, and IT 
states shift left. T states do not move, and 0i are spreading. 

2. A layer with an automaton invading the configuration from a seed. We 
need five states on this layer: F2 = {3,^2,^,-^, K2}. We describe the 
rule below. On figure [71 this layer is drawn dashed. 

The rule here is even simpler: 02 states do not move, — >■ states shift right, 
■s— states shift left. State s represents a signal "seed" , meaning that if it 
appears once, it disappears on the next step, and changes into a — > signal 
on its right, and a signal on its left. 

We add a few rules that allow to verify the well-formedness of configurations. 
This allows us to ensure that there can be only one T state on the first layer, 
and that signals on the second layer never cross. States Ki and K2 are used 
for this purpose: if one of them appears somewhere, they both spread on both 
layers, thus erasing the whole configuration: the Pred problem becomes trivial. 

• If a tr state is found immediately next to an state, then Ki and K2 
are both raised. 

• If a — > signal is found in the same cell as an ~(^, or a -s— in the same cell as 
an tr, then Ki and K2 are raised. This ensures that signals on the second 
layer never cross. 

Moreover, we introduce another rule to perform the equality test: when the 
test is negative (i.e. a T state has an on its left, a V on. its right, and x y), 
then we place an s state on the second layer : 
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Figure 7: A CA easy for Pred and hard for Inv 
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Proposition 10. The CA F described above is such that: 

1. CC(Predf) e 0(1), 

2. there is u such that CC (Inv^) e i}{n). 

Proof. 1. A protocol for Pred needs to predict the content of both layers: 
if the configuration is not well-formed, then a Ki state will appear some- 
where and this is easy (and it can be checked locally by Alice and Bob). 
Else : 

• On the first layer, the result will always be the result of a shift if the 
initial configuration contains only or Ir" states, or if the T state 
is not the central cell of the configuration, and a T state else. This 
requires a constant number of communicated bits. 

• On the second layer, there are four - possibly overlapping - possibil- 
ities: 

— If the leftmost state of Alice's differs from the rightmost state of 
Bob's, and the central cell is a T state, the result is an s. 

— If the T state is not the central cell, but somewhere else in the 
left part, and the corresponding word is not a palindrom, then a 
— is launched (see figure [5]). 

— If the initial configuration contained an s or a —t- in its leftmost 
cell, a — arrives to the top of the triangle. 

— Else, the result is a 02. 

All of these can be checked locally and communicated between Alice 
and Bob within a constant number of bits. 

2. Now we need to find a set of hard instances for the Inv problem: with a 
background word u, with 0^ on both layers, and an initial configurations 
of the form (0,1 )"T( , 1 )" on the first layer, and 0* on the second, 
we reduce the equality problem to Inv. 

□ 
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Figure 8: A CA easy for Pred and hard for Inv 

4-4-2- A CA easy for Cycle and hard for I^y 

We can reuse the construction of paragraph 14.4.11 we aheady know that 
it is hard for Inv. What we need to do is to modify the rule so that on the 
second layer, when a — > signal crosses a signal, they both disappear and the 
resulting state is a 02. This ensures that on cyclic configurations, even if signals 
are "raised" somewhere, they are "caught" by the cyclicity. The rest of the 
discussion is essentially the same as in paragraph 14.4. 1[ and we can conclude 
easily that the orbits of configurations containing at least one T, or of ill-formed 
configurations, are always 1-periodic; the Cycle problem can be decided with 
no communication. In all other cases, the dynamic is nothing more than a shift: 
the protocol from [5] can be used. 

A CA easy for Pred, and hard for Cycle 

We can use once again (and for the last time) quite the same construction 
as in paragraph 14.4.11 We modify it to launch only one signal (in only one 
direction) when an error appears. Thus, as proven in section 14.4.11 the Pred 
problem remains easy. Now we need to prove that the Cycle problem is hard, 
but for this we can choose the instances on purpose. 

If no test fails, the configuration will be 1-periodic: When all the tests have 
been done, the configuration is uniformly empty, except for the T states, and 
then nothing more happens. Otherwise, a signal will be launched. We need to 
show that the period of the configuration is then in J7(n). But we can notice 
that a contiguous portion of Q,{n) cells can not have any signal (see Figure [S]). 
Therefore, the period of the configuration is 51(n) if and only if an error occurs. 

4-.Jf.4- A CA easy for Inv and hard for Cycle 

As promised in remark [4. 2 1 we now prove a protocol for the Inv problem of 
the rule described there: 

Proposition 11. The CA F described in the proof of proposition\^is such that: 

Vu, CC (INV^) e 0(1) 

Proof. Let u be any word over the alphabet for F. First, if the orbit of p„ con- 
tains a spreading state, then Pu{w) quickly becomes uniform with the spread- 
ing state everywhere, independently from w. Else, the discussion is a little 
more subtle. Let us note the periodic background Pu = ipunPu^^Pui), and let 
w — (wi,u'2, W3) the input, split between Alice and Bob. 
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<n/2 signals 




n/4 : No signals here 



Figure 9: A CAeasy for Pred and hard for Cycle. 



1. If Pui{wi) ^ Pui, and Pm2(w2) 7^ Pti2j and then either a spreading state 
is generated, or the differences on components one and two are shifted in 
opposite directions, thus also invading p„. 

2. If ptjj(z«i) — and Pu2{'^2) = Pu2i rnaybe the third component (the 
actual "tests") changes between pu and puiw), but then there is an easy 
way to transmit whole configurations : Alice can simply tell Bob that 
her part is the same as in pu, on the first two components. If Bob does 
the same, then both know both "sets" , and they can check without more 
commmiication if their respective portions of Pusiwz) ever generates a 
spreading state : if so, Pu(w) is invaded, else it is not. 

3. Else, without loss of generality, we can assume that Pmiwi) — Pm and 
Pu2 {w2 ) 7^ Pu2 ■ There are two cases : 

• Either Pusiws) = Pus (the "tests" are the same in p„ and pu{w)), 
and then using the trick from Alice and Bob can know pu^{wi) 
and Pu3{w'i) completely, within constant communication. 

Then, since they each know a part of set Pu2 (""^2 ) , and they both know 
Pu3 {w^), they can check disjointness with pu^ (wi) separately and tell 
if a spreading state ever appears, which is the only way Pu{w) can be 
invaded in this case. 

• If ^113(^3) 7^ Pu-i, then either a spreading state is generated, or 
^143(^3) stays fixed, and Pu2{w2) shifts to infinity: in both cases, 
Pu{w) is invaded. 



4. 4-5- A CA easy for Iny and hard for Pred 

Elementary rule 218 is a natural example exhibiting this property. Unfortu- 
nately, the proof is quite technical and requires an in-depth study of rule 218, 
which we chose to delay until section 16.11 for conciseness of this -already long- 
section, and consistency of section |6l 



□ 
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4-4-6. An CA easy for Cycle and hard for Pred 

We describe the natural example of Rule 33 in Section 16. 3[ which has a 
protocol in constant time for Cycle, and for which any deterministic protocol 
for Pred is in fl(\ogn). 



5. Intrinsic universality: Ruling out complex CA 

Here we show that for two of our canonical problems - namely, Pred and 
Inv - we were able to find a CA of maximal algorithmic complexity (complete), 
and yet very simple with respect to our framework. 

More precisely, we are going to show that, for problems Pred and Inv, there 
exists a CA F for which the communication complexity of the problem is low 
while its classical computational complexity is the highest one can expect. 

Therefore, we are ruling out such non-trivial CA from being intrinsically 
universal. 



5.1. Prediction 



T. Neary and D. Woods proved "the P-completeness of Rule 110" [ll|. In 
our language, they proved that the problem Predf^^q is P-complete. A very 
natural question arises: What do classical algorithmic properties of CA, such 
as P-completeness, imply on their communication complexity counterpart? 

As we show in this section, such a strong computational property is not 
enough to guarantee maximal communication complexity. However, we do not 
know of an automaton that would have, for instance, polylogarithmic commu- 
nication complexity, and still a P-complete prediction problem, nor do we have 
a nonexistence proof. We leave this as an open problem. 

Proposition 12. For any k > 1, there exists a CA F such that 

CC (Predj.) e 0{n^/^) 

and Predj? is P-complete. 

Proof. Let M a Turing machine. We construct a CA F simulating Ai slowly 
but still in polynomial time: it takes n'^' steps of F to simulates n steps of Ai. 
Hence, by a suitable choice of M, the problem of predicting F is P-complete. 

First it is easy to construct a CA simulating A4 in real time. We encode 
each symbol of the tape alphabet of the Turing machine by a CA state, and 
add a "layer" for the head, with '— >■' symbols on its left and ' symbols on its 
right. We guarantee this way that there can be only one head: if a '— state is 
adjacent to a state without a head between them, we propagate a spreading 
"error" state destroying everything. 

We then add a new layer to slow down the simulation: it consists in a single 
particle (we use the same trick to ensure that there is only one particle) moving 
left and right inside a marked region of the configuration. More precisely, it 
goes right until it reaches the end of the marked region, then it adds a marked 
cell at the end and starts to move left to reach the other end, doing the same 
thing forever. Clearly, for any cell in a finite marked region, seeing n traversals 
of the particle takes fl{n^) steps. Then, the idea is to authorize head moves, 
in the previous construction, only at particle traversals. This way, n steps of 
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M require time steps of the automaton. By adding another particle layer, 
one can also slow down the above particle with the same principle and it is not 
difficult to finally construct a CA F such that n steps of A4 require n'' time 
steps of F. We have represented in Figure [TU] the behavior of the particle, with 
the dashed arrow representing a Turing transition. 

Now if the initial configuration does not respect the rules described above, 
then a spreading error state is generated and Alice and Bob can notice it within 
constant communication. In all other cases, it is enough for Alice or Bob to 
know the value of all the 2 • n^/*"' states around the initial position of the head, 
because the computation of the Turing machine simply does not depend on the 
rest of the initial configuration. So for these cases, at most n^^'^ bits need to be 
communicated for Alice or Bob to compute the answer. Note that if the bounds 
for the particle are absent from the initial configuration, then no transition can 
happen, thus Alice and Bob know the result in constant time. 











^ ^ q ^ ^ ^ 





Figure 10; A CA for which Pred is P-complete. 

□ 

Remark. A result by Hromkovic (see f^J) states that a Turing machine with a 
single head working in time t{n) can only recognize a language of communication 
complexity less than 0{^t{n)). Said differently, a CA simulating a Turing ma- 
chine cannot produce instances of communication complexity more than 0{y/n) 
for the prediction problem on configurations with a single head (whatever the 
machine does). 

5.2. Invasion 

This problem is even more complex than Pred: It is in fact undecidable. 
However, since there is no limitation on the "classical" computational power of 
Alice and Bob, it can still be decided within very little communication. 

Proposition 13. 

1. For any CA F and any word u, we have: Inv^ G IIj'. 

2. Their exist F and u such that Inv^ is Hi-complete, and yet CC (Inv^) e 
O(logn) 

Proof. 

1. Let F and u be fixed and consider the problem Inv^. Given an in- 
put xi, . . . , Xn, we use the notations 5i{t) and 5r{t) for the leftmost and 
righmost differences at time t between the orbit of pu and the orbit of 
Pu{xi ■ ■ ■ Xn) as in Definitional 
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Claim. There exists a recursive function /3 such that for any n, any input 
Xi, . . . ,Xn and any A > we have: 

3t, dr{t) - di{t) > A <^ 3t < /3{A), Srit) ^ 6i{t) > A. 

The proof follows from the above claim because the invasion problem can 
be expressed as the following II^ predicate: 

VA >0,3t< /3(A), Sr{t) - Si{t) > A 

V ' 

recursive predicate 

Proof of the claim. First, the orbit of is ultimately periodic: There are 
to and p such that for any t > to we have F*{pu) — F*'^P{pu). Given 
an input xi,. . . ,x„ of the problem, denote by w{t) the word of length 
dr{t) — di{t) starting at position Si{t) in configuration F*{pu{xi, . . . ,x„)). 
The key point is that for any t > to, the triple 

Xit + 1) = {w{t + l),6i{t+ 1) mod \u\,t + l modp) 

is uniquely determined by the triple 

x{t) — {w{t), Si{t) mod \u\,t mod p) 

(because the word w{t) "evolves" in a periodic context and knowing the 
offset of the position of w(t) in that context is enough to know w{t + 1)). 
Therefore, if the words w{t) are bounded by A for a sufficiently long 
time (exponential in A), then the triple x(i) will take a value already 
taken before and the sequence (x(O)t be ultimately periodic, showing 
that 1^(^)1 is bounded and that there is no invasion. Adding to to this 
exponential function is a convenient choice for /3. □ 

2. We build a CA F that simulates a 2-counter machine More precisely, 
standard states have two layers: a data layer over states A, M, B, 0, used 
to store the value of the 2 unary counters, and a control layer made of 
a Turing head storing a state from Q, with the extra — >■ and symbols 
ensuring the uniqueness of the head. Finally, F possesses a blank state 
and a spreading state K to deal with encoding problems. The state set is 
therefore 

(Q U {K, 0, ^}) X {A, B, 0, M}. 

A valid configuration is a configuration everywhere equal to except on 
finite coding segments which have the folloing form (see figure [TT|) : 

• the data layer must be of the form: 0*A+A/i3+0*; 

• the control layer must be of the form: — q -S— + with q (z Q. 



Control layer 


0^^^9^^0 





Data layer 


(ll A A A M B ^ 






Figure 11: A well-formed piece of configuration. Tiie counter A contains value 3 and the 
counter B contains value 1 in this example. 

The number of As and Bs represent the current value of the 2 counters. 
The behaviour of F is the following: 



23 



• If the configuration is not valid (which can be detected locally), then 
the state K is generated and spreads; 

• If the configuration is valid, then on each coding segment, the (nec- 
essarily unique) head goes repeatedly frorn one end of the segment 
to the other end, and extends the segment at each pass by adding a 

on the left (resp. ^ on the right) and a on the data layer. If 
the extension step is blocked by another segment, then the state K 
is generated and spreads; 

• Moreover, at each pass on the segment, the head executes one of the 
basic 2-counter machine's instructions: 

— testing if a counter is empty can be done by checking if there is 
a on the right (resp. the left) of the unique M; 

— decrementing can be done be replacing the leftmost A (resp. 
rightmost B) by a 0: 

— incrementing can be done by replacing a by A on the left of the 
leftmost A (resp. by B on the right of the rightmost B); there 
must be a 0, because the segment is extended at each passage by 
both sides; 

— finally, the head can simply stop. 

If any order given to the head leads to an incoherence (decrement an empty 
counter, write a B when on the 'A' part of the segment, etc), the state K 
is genereated and spreads. 

With this definition, and if m = 0, the halting problem for the 2-counter 

machine encoded in F (input: value of counters; output: docs it halt 
started from these values ?) clearly reduces to Inv^ (halt ■4=4> no inva- 
sion). Therefore, by a suitable choice of the 2-counter machine used to 

construct F, we have that INV^ is Ilj'-complete. 

To conclude the proof, we show that CC (Inv^) G 0(log(n)). Given an 
input w split between Alice and Bob, the following protocol determines 

whether Inv^(w;) = 1: 

• first Alice and Bob check whether the input configuration is valid; 
if not, the answer is 'invasion'; this can be done with 0(1) bits of 
commimication since validity is a local property; 

• the configuration being valid, Alice and Bob communicate so that for 
any pair of consecutive valid segments s\ and S2, either Alice or Bob 
knows the state of both si and S2 and the distance between them; to 
achieve this, even if a segment is split between Alice's part and Bob's 
part, it is sufficient that they communicate 0(log(n)) bits; indeed, a 
segment is completely defined by: 

— the value and position of the head, 

— number of states on the right and the same on the left, 

— number of ^s and number of Bs. 

• since for each pair of valid segment, Alice or Bob as enough informa- 
tion to detect a possible future collision, they can determine together 
with 0(1) bits of communication whether there is invasion or not; in- 
deed, invasion is eqiiivalent to: either their is a collision somewhere, 
or their is a single segment holding a non-halting computation. 

□ 
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5.3. Cycle-length 

For this problem, we could find a CA of maximal algorithmic complexity, 
as shown by the following proposition. However, we have to leave as an open 
problem the existence of a CA F for which both Cycle^ is PSPACE-complete 
for some fc e N, and CC (^Cycle|^ e o{n). 

Proposition 14. 1. For any CA F and any k>l, Cycle|. e pspace. 
2. Their exist F and k such that CycleI^ is PSPACE-complete. 

Proof. 

1. Let F and fc > 1 be fixed. The length of the cycle reached by iterating F 
on a periodic initial configuration c can be determined in polynomial space 
with the algorithm described below. Let n be the period of c. Starting 
from c, the cycle is reached in less than a" steps where a is the cardinal 
of the state set. 

(a) compute cq = F" (c) (memory usage: 0{n)); 

(b) memorize cq and compute the first t such that _F*(co) = cq (memory 
usage: 0(n) because such a t is less than a"). 

2. To show this, we embed a Turing machine M, deciding a PSPACE-complete 
language, in a cyclic configuration for a cellular automaton. A4 works in 
polynomial space, meaning that there is a polynomial P G N[X] such that 
for any x eT*, it will never use more than P(|a;|) tape cells. 




Figure 12: The output of the transducer used in Proposition^] 

We can encode a Turing machine easily into a simple cellular automaton 
F: the states code for the Turing tape cells, and there is a special "head" 
state carrying the state of the machine. It can be easily shown that we can 
encode the transitions of a Turing machine into a local cellular automaton 
rule, ensuring that if there is only one head at the beginning, then it will 
be so during all the computation. 

Moreover, the accepting state is spreading, meaning that if it appears 
somewhere, it spreads over all the configuration in both directions. The 
rejecting state launches a particle erasing the configuration (i.e., writing 
blank states everywhere), but shifting clockwise. In this way, an accepting 
computation will result in period 1, whereas rejecting computations will 
yield periods of the size of the configuration. 
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A polynomial-time transducer can easily encode an input x for M into 
a (cyclic) configuration of F, like shown in figure [T^l It first directly 
translates x into states of F, then computes -P(|a;|) and outputs P{x) 
blank states. 

□ 



6. Intrinsic universality: Ruling out concrete elementary CA 
6.1. CA Rule 218 

The local function /218 : {0, 1}^ {0, 1} of CA rule 218 is defined in Fig- 



ure 
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(a) F218. 
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(b) Example of a space-time diagram for CA Rule 218. 



Figure 13: CA rule 218. 

From the result of fll we already knew that CC (PREDi^^ig) G 0(log(n)). It 
follows from Corollary [2] that Rule 218 is not intrinsically universal. Neverthe- 
less, the proof of J5] was very long and complicated. As we are going to see now, 
the invasion approach gives a short and elegant proof of the same result. 

Definition 7. A word is additive if Is are isolated and separated by an odd 
number of Os. By extension, an infinite configuration is additive if it contains 
only additive words. 

Lemma 2. Additivity is preserved by iterations. Moreover, if abc is additive 
then: 

/2i8(a, 6,c) 7^ /2i8(l - a,b,c) and f2i&{a,b,c) ^ f2i8{a,b, 1 - c). 



Proof. First additivity is preserved by iterations because 010"10 becomes 010" ^10 
forn > 3 and 01010 becomes 000. 
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To conclude the lemma, it is sufficient to check that, for any a,b,c such that 
11 is not a factor of abc then: 

/2i8(a, b, c) ^ /2i8(l - a, b, c) and f2is{a, b, c) ^ /2i8(a, b,l-c). 

□ 

Lemma 3. Let c be any non-additive configuration. Then, after a finite time, 
the word 11 appears in the evolution and this word is a wall. 

Proof. First 11 is a wall because: 

/218(*,1,1) = /218(1,1,*) = 1- 

To conclude it is sufficient to check that the image of 10"1 with n > 2 is 10"~^1. 

□ 

Proposition 15. For all u, we have CC (Inv^.^^^) < 1. 

Proof. First, if the configuration pu is non- additive then, by Lemma [31 at some 
time t a wall appears periodically in ^2i8(^'«)- Hence, for any xi, . . . ,x„, the 
differences between pu{xi, . . . , x„) and Pu are bounded to a fixed finite region. 
Said differently, there is never propagation for such an u. 

Now consider the case where p„ is additive. By Lemma [2j we have for any 

Xi , . . . , Xfi . 

• either p„ = Pu{xi, x„), 

• or for any t > 0: 

6iit) ^ 6i{0) ~ t 
5r{t) = (5^(0) + t 

Therefore, the problem consists in deciding whether p^ and pu{xi, . . . 
are equal, which can be done with 1 bit of communication. □ 

Corollary 4. CA Rule 218 is not intrinsically universal. 

As promised in section |4A5l it remains to show that the deterministic (pos- 
sibly with several rounds) communication complexity of the Pred problem for 
rule 218 is "hard", according to our conventions: 

Proposition 16. 

PREDi?2jg e ri(logn) 
Proof. To show this, we construct a fooling set S'„ (see Definition [1] or Q): 

s'„ = {(l"-'=o^ o'^+ll"-^ <k<n} 

We show that Sn is a fooling set for Rule 218: In fact, on all configurations of 
the form i^-'^o^'^+il"-''^ the result of PREDi?2j^g is always 0. On configurations 
of the form l"~*0'"'"^"'""'^l"~'' where i ^ j, it is always 1. This can be easily shown 
from the collection of lemmas of Q , and we illustrate it on Figure [TH Thus, 
since j^nj = n + 1, we deduce that a deterministic protocol solving PreDj?"^^ 
can not take less than log(n + 1) steps: 

CC (Pred^^^^^J e r!(log(n)) 

□ 
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Figure 14: A configuration of the fooling set Sn for rule 218 

6.2. CA Rule 94 



The local function fgi : {0, 1}^ {0, 1} of CA Rule 94 is defined in Fig- 



ure 
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(a) /94- 




(b) Example of a space-time diagram for CA Rule 94. 
Figure 15: CA Rule 94. 



Here appears clearly how powerful the invasion approach is (as a tool for 
proving non- universality) . Finding an upper bound (a protocol) for CC (PRED^g^) 
seems to be hard. Nevertheless, here we prove in a rather simple way that its 
invasion complexity is logarithmic. 

Definition 8. A configuration is additive if its language is included in ((00)"*" (11)"*") 
(blocks of Os or Is are always of even length). 

Lemma 4. fg^ is bi-permutative when restricted to additive configurations (it 
behaves like /go ) and additive configurations are stable under iterations. 

Proof. For stability of additive configurations, it is sufficient to check that 
00(11)"00 becomes ll(00)"-ill and 11(00)"11 becomes ll(00)"-ill for n > 1. 
/g4 differs from /go only for transition 010, hence bi-permutativity. □ 

Lemma 5. If c is a non-additive configuration which does not contain 010, then 
101 appears after a finite time and it is a wall. More precisely, a wall appears 
after t + 1 steps of CA Rule 94 at the middle of any occurrence of 10'^*+^ 1 or 
Oi2t+3o (^nh t>0). 
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Proof. First 101 is stable under iterations of fg^. Second, 10"1 with n > 2 is 
sent to 10"-2l and 01"0 is sent to lO""^! for n>2. □ 

Lemma 6. The orbit of a configuration c contains a wall if and only if Fg^^c) 
is not additive. 

Proof. From Lemma O it is enough to show that if c is a configuration not 
containing 101 then ^94(0) does not contain 010. For that, it is sufficient to 
check that any word u such that fg4(u) = 010 must contain 101. □ 

From the 2 lemmas above, we get the following proposition. 

Proposition 17. For any u we have CC (Inv^^^) G 0(log(n)). 

Proof. If u is such that the orbit of Pu contains a wall, then invasion never 
occurs. 

If u is such that the orbit of p„ does not contain any wall, then it means 
that i^94(pu) is additive (by Lemma IH]). In this situation, two cases are to be 
considered depending on the input xi, . . . ,a;„. Knowing in which case we are 
can be done within constant number of bits: 

• either Fg4^{pu{xi, . . . , x„)) is also additive and then, by Lemma HI there is 
invasion if and only if Fgj^ipu) = Fq4^{pu{xi, . . . , x„)). This can be decided 
with a finite number of bits of communication. 

• or Fg4^{pu{xi, . . . ^Xn)) is not additive. Then it contains some 10^*~''^1 
or some 01^*+^0 (with t > Q) because, as shown in the proof of lemma, 
if the image of a configuration contains 010, then it must also contain 
101. Consider the leftmost and the rightmost occurrences of this kind of 
words. Since walls appear above the middle of these two occurrences after 
a time equal to half their lengths (Lemma [5]), the fact there is invasion or 
not does not depend on what is beetween that two occurrences. It takes 
0(log(n)) bits of communication for Alice to know the positions of these 
two occurrences and the exact words present at their positions (of type 
10^*+^ 1 or 01^*+^0). Moreover, as soon as Alice knows this she also knows 
that on the left of the leftmost occurrence and on the right of the rightmost 
occurrence, the configuration is additive. If there is one difference with 
p„ in those additive parts, then there is invasion. If not, then Alice has 
got enough information to decide invasion. Deciding in which of the two 
cases we are can be done within constant communication. 

□ 

Corollary 5. CA Rule §4 is not intrinsically universal. 



6.3. CA Rule 33 

We are going to show that this rule, although non-trivial for the Fred 
problem, needs zero communication for the Cycle problem. To show this, 
we prove that the cycle length of Rule 33 is always 2. The local function 
/33 : {0, 1}^ ^ {0, 1} of CA Rule 33 is defined in Figure [16(a) 



Lemma 7. All configurations that do not contain neither 101 (isolated Osj nor 
1001 (isolated OOsJ are stable under {F^^Y . 
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(a) /33. 




(b) Example of a space-time diagram for CA Rule 33. 



Figure 16: CA Rule 33. 

Proof. We call Aq the set of configurations without isolated Os, and Aqq the 
set configuration without isolated 00s. First notice that the only antecedent of 
101 is 10101, which contains an isolated 0, thus Aq is stable under ^33. With 
an exhaustive exploration of all configurations of the form u — abxyzcd where 
xyz e {000 . . . Ill}, and u ^ AqD Aqq, we observe that: 

Vu e Ao n Aoo, |u| = 7, (F33)^(ui . . . W7) = U3U4U5 

□ 

Lemma 8. All (cyclic) configurations of length n, different from (01)L"/^J, do 
not contain isolated Os after [^J steps of CA Rule 33. 

Proof. We already noticed in Lemnia[7]that the only possible antecedent of 101 
is 10101. Thus, there can be an isolated after [^J steps only if there are at 
least [^J isolated Os in the initial configuration, i.e. if the initial configuration 
is (01)L«/2J. 

□ 

Corollary 6. After [^J + 1 steps of CA Rule 33, there are no isolated couples 
ofOs. 

Proof. The only antecedents of 1001 contain an isolated 0. □ 
Corollary 7. After [f J + 1 steps, CA Rule 33 becomes periodic, with period 2. 
Proposition 18. 

CC(Predf33) e n{logn) 

Proof. As usual, we just find a fooling set (see Definition [T]). Consider the 
following set Sn'. 

S'„ = {(l"-2fc(Q^)fcQ^ (10)'^'l"-2fe)^o < k < [n/2\} 
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It can be easily verified that: 

r F3"3(l2"-'=(01)'=0(10)'''l2n-fe) ^ ^ j^qJ 2 

I i^3"3(l2"-*(01)^0(10)n2"-J) = l + (n mod 2) whenever i 7^ j 

Since \Sn\ — [§J , we conclude that a deterministic protocol for predicting 
rule 33 needs D,{logn) bits of communication. 

□ 

7. Conclusion 

We have suggested a method to prove negative results concerning intrinsic 
universality in CA. We have shown that this approach can be used both to 
show that global dynamical properties can imply non-universality, and to rule 
out some concrete cellular automata from being universal. We believe that this 
work should go on in the following directions: 

• It seems that more can be said about the communication complexity prob- 
lems for the class of surjective CA and some of its sub-classes (/c-to-l, 
d-separated, left/right-closing, etc. @); 

• The case of elementary rules 218 and 94 shows that low-cost communi- 
cation protocols can be found in CA that are not linear, but containing 
a linear component 'in competition' with another component. Finding a 
general formalisation for such kind of behaviours could be useful to treat 
many other concrete examples. 

• Concerning concrete CA, ruling out as many elementary rules as possible 
from being intrinsically universal seems to be an interesting (but ambi- 
tious) goal. We could also consider other natural classes of small CA 
(one-way automata, totalistic rules, etc.). 

• The splitting of inputs that induce maximal communication complexity is 
a key parameter, especially for the prediction problem. There is no reason 
for such maximal splittings to be unique, and if it is unique, there is no 
reason to be located in the middle of the input. We suspect that there are 
some links between directional entropy and the evolution of such maximal 
splitting (when increasing the input size). 

• Although completely formalized in dimension 1 , there is no doubt that this 
approach can be adapted to higher dimensions; it could be the occasion to 
adopt other communication complexity models (like the multiparty model) 
and discuss other ways of splitting the input. 
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